1 2 Accelerating the discontinuous Galerkin method for seismic wave 3 propagation simulations using multiple GPUs with CUDA 4 and MPI 5

نویسنده

  • Liqiang Wang
چکیده

8 Abstract We have successfully ported an arbitrary high9 order discontinuous Galerkin method for solving the three10 dimensional isotropic elastic wave equation on unstruc11 tured tetrahedral meshes to multiple GPU using CUDA and 12 MPI and obtained a speedup factor of about 28.3 for the 13 single-precision version of our codes and a speedup factor 14 of about 14.9 for the double-precision version. The GPU 15 used in the comparisons is NVIDIA Tesla C2070 Fermi, 16 and the CPU used is Intel Xeon W5660. To effectively 17 overlap inter-process communication with computation, we 18 separate the elements on each subdomain into inner and 19 outer elements and complete the computation on outer 20 elements and fill the MPI buffer first. While the MPI 21 messages travel across the network, the GPU performs 22 computation on inner elements, and all other calculations 23 that do not use information of outer elements from neigh24 boring subdomains. A significant portion of the speedup 25 also comes from a customized matrix–matrix multiplica26 tion kernel, which is used extensively throughout our 27 program. Preliminary performance analysis on our parallel 28 GPU codes shows favorable strong and weak scalabilities. 29 30

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MPI- and CUDA- implementations of modal finite difference method for P-SV wave propagation modeling

Among different discretization approaches, Finite Difference Method (FDM) is widely used for acoustic and elastic full-wave form modeling. An inevitable deficit of the technique, however, is its sever requirement to computational resources. A promising solution is parallelization, where the problem is broken into several segments, and the calculations are distributed over different processors. ...

متن کامل

Accelerating high-order WENO schemes using two heterogeneous GPUs

A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parall...

متن کامل

High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster

We implement a high-order finite-element application, which performs the numerical simulation of seismic wave propagation resulting for instance from earthquakes at the scale of a continent or from active seismic acquisition experiments in the oil industry, on a large cluster of NVIDIA Tesla graphics cards using the CUDA programming environment and non-blocking message passing based on MPI. Con...

متن کامل

Sustained Petascale Performance of Seismic Simulations with SeisSol on SuperMUC

Seismic simulations in realistic 3D Earth models require petaor even exascale computing power to capture small-scale features of high relevance for scientific and industrial applications. In this paper, we present optimizations of SeisSol – a seismic wave propagation solver based on the Arbitrary high-order accurate DERivative Discontinuous Galerkin (ADER-DG) method on fully adaptive, unstructu...

متن کامل

A High-order Discontinuous Galerkin Scheme for Elastic Wave Propagation

In this paper, we introduce a fourth-order leap-frog time scheme combined with a high-order discontinuous Galerkin method for the solution of the elastodynamic equations. The time discretization, obtained via a simple construction based on Taylor developments, provides an accurate scheme for the numerical simulation of seismic wave propagation. Results of the propagation of an eigenmode allow a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013